Study on parameters of the variable threshold to detect local speech rate deceleration in Japanese spontaneous conversational speech
نویسندگان
چکیده
1. Introduction In human communication, speech conveys not only linguistic information but also emphasis, intention, attitude and so on. They are called paralinguistic information [1]. There are several researches on paralinguistic information [2,3]. Methods for modeling or detecting of paralinguistic information is useful for various application in man-machine communication such as speech synthesis with rich expressions and recognition of paralinguistic information in spontaneous speech. A speaker controls prosodic features such as fundamental frequency, power and temporal structures to express paralinguistic information. It is said that there are few speech rate variations in Japanese read speech. In spontaneous conversational speech, however, a speaker sometimes controls speech rate greatly to obtain a listener's attention. We previously found that speech rate of important words or portions of sentences is slowed to obtain the listener's attention [4]. In order to understand paralinguistic information using a computer, it is one of important issues to detect portions of sentences in which the speaker intentionally decelerates the speech rate. There are several studies on local speech rate variation [5–7]. However, there are few studies on detection of local speech rate variation. We try to detect a local slower portion from a time series of mora duration [4,8]. When the speech rate of one portion is slower than that of other portions, the mora duration is longer than the durations of other morae. However, it is known that variation in time series of mora duration is caused not only by intentionally controlled speech rate variation but also by other factors such as difference of phonemes, length of a phrase or a sentence and a position of a mora in a phrase or sentence [9]. We have proposed the variable threshold (VT) [8] for detecting a local slower portion decelerated by a speaker from observed mora duration. The VT is applied to time series of mora duration. A mora whose duration exceeds the VT is detected as a local slower portion. The outline of the VT is described in section 2. In this paper, we examine the properties of parameters in the VT that are used for determining range and speed of variation of the VT. Three sets of parameters are prepared. We assume that these sets of parameters correspond to the levels of a listener's attention to
منابع مشابه
Evaluation of the method to detect Japanese local speech rate deceleration applying the variable threshold with a constant term
We are aiming to detect local deceleration of Japanese spontaneous conversational speech. We have proposed the variable threshold (VT), which detects local speech rate deceleration from the sequence of time series of mora duration. In this paper, we add a constant term to the VT to detect local deceleration appropriately. The VT is applied to 167 samples of Japanese spontaneous speech taken fro...
متن کاملDetecting Japanese local speech rate deceleration in spontaneous conversational speech using a variable threshold
The variable threshold(VT), which detects the speech rate deceleration, is proposed. The VT varies dynamically depending upon the duration of previous mora in the utterance. The VT should not change rapidly because listener cannot perceive small variations of mora duration. Thus, a set of functions with time constants which decide response speed of the VT is introduced. We apply the VT to six s...
متن کاملA Fundamental Study on a Method to Detect Slower Phrases in Japanese Dialog Speech
A slower phrase in spontaneous conversational speech is caused by emphasis, thinking during speaking and so on. To include such useful information with man-machine communication, we investigate a method to detect local slower phrase from time sequence of mora duration in Japanese dialog speech. At first we prepare speech samples, which contains phrases slowed considerably. Then the flow of the ...
متن کاملEvaluation of a threshold for detecting local slower phrases in Japanese spontaneous conversational speech
I have proposed a method for detecting local slower phrases in Japanese spontaneous conversational speech. A threshold is applied to phrase-averaged mora duration in this method. It is considered that relative variation of time sequence of phrase-averaged mora duration should be taken into account for detecting slower phrases correctly. In this paper, preliminary experiments are carried out to ...
متن کاملAcoustic variability in spontaneous conversational speech of american English talkers
Speaker variability strongly impacts human perception and technology performance, yet large-scale, systematic study of the acoustic characteristics involved is rarely undertaken. This study provides statistics on selected segmental and suprasegmental acoustic parameters from measures made on spontaneous conversational telephone speech from 160 speakers in the Switchboard Corpus. Since spontaneo...
متن کامل